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Abstract A tree automatic structure is a structure whose domain can 
be encoded by a regular tree language such that each relation is recog- 
nisable by a finite automaton processing tuples of trees synchronously. 
Words can be regarded as specific simple trees and a structure is word 
automatic if it is encodable using only these trees. The question nat- 
urally arises whether a given tree automatic structure is already word 
automatic. We prove that this problem is decidable for tree automatic 
scattered linear orderings. Moreover, we show that in case of a posit- 
ive answer a word automatic presentation is computable from the tree 
automatic presentation. 



1 Introduction 

The fundamental idea of automatic structures can be traced back to the 1960s 
when Biichi, Elgot, Rabin, and others used finite automata to provide decision 
procedures for the first-order theory of Presburger arithmetic (N; +) and several 
other logical problems. Hodgson generalised this idea to the concept of auto- 
maton decidable first-order theories. Independently of Hodgson and inspired by 
the successful employment of finite automata and their methods in group theory, 
Khoussainov and Nerode [3] initiated the systematic investigation of automatic 
structures. Recalling the efforts from the 1960s, Blumensath [2] extended this 
concept notion beyond finite automata to finite automaton models recognising 
infinite words, finite trees, or infinite trees. 

Basically, a countable relational structure is tree automatic or tree automat- 
ically presentable if its elements can be encoded by finite trees in such a way that 
its domain and its relations are recognisable by finite automata processing either 
single trees or tuples of trees synchronously. A structure is word automatic if its 
elements can be encoded using only specific simple trees which effectively rep- 
resent words. In contrast to the more general concept of computable structures 
and based on the strong closure properties of recognisability, automatic struc- 
tures provide pleasant algorithmic features. In particular, they possess decidable 
first-order theories. 

Due to this latter fact, the concept of automatic structures gained a lot at- 
tention which led to noticeable progress (cf. |H6) b Automatic presentations were 
found for many structures, some structures where shown to be tree but not word 
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automatic, for instance Skolem arithmetic (N; x), whereas other structures, like 
the random graph, were proven to be neither word nor tree automatic. For some 
classes of structures it was even possible to characterise its automatic members, 
for example an ordinal is word automatic respectively tree automatic precisely 
if it is less than w w respectively oj u . Certain extensions of first-order logic were 
shown to preserve decidability of the corresponding theory. The question whether 
two automatic structures are isomorphic turned out to be highly undecidable in 
general as well as for some restricted classes of structures. At the same time, the 
isomorphism problem for word automatic ordinals was proven to be decidable. 
Last but not least, the different classes of automatic structures was characterised 
by means of interpretations in universal structures. 

Due to the fact that word automaticity is a special case of tree automaticity, 
the question naturally arises whether a given tree automatic structure is already 
word automatic. As far as we know, this problem was neither solved in general 
nor for any restricted class of structures. For that reason, we investigate the 
respective question for scattered linear orderings in this paper. Actually, we prove 
the corresponding problem to be decidable and our main result is as follows: 

Theorem 1.1. Given a tree automatic presentation V of a scattered linear or- 
dering £, it is decidable whether £ is word automatic. In case £ is word auto- 
matic, one can compute a word automatic presentation of & from V . 

Since every well-ordering is scattered, this result still holds if £ is assumed to be 
an ordinal. The proof of Theorem 11.11 splits into three parts. First, we introduce 
the notion of slim tree languages and prove this property to be decidable (The- 
orem [32]). Second, we show that a slim domain is sufficient for a tree automatic 
structure to be word automatic (Theorem 14. ip . Last, we demonstrate that this 
condition is also necessary in case of scattered linear orderings (Theorem 15. lj) . 
Altogether, Theorem 11.11 follows from the three mentioned theorems Q 

2 Background 

In this section we recall the necessary notions of logic, automatic structures 
(cf. |l|6j ). tree automata (cf. [3]), and linear orderings. We agree that the natural 
numbers N include and that [m, n] = {m, m + 1, . . . , n} C N for all m, n € N. 

Logic. A (relational) signature r — (1Z, ar) is a finite set 7Z of relation symbols 
together with a map ar: 1Z — > N assigning to each R G 1Z its arity ar(i?) > 1. 
A r-structure 21 = (A; {R % )r £ ti) consists of a set A = dom(2l), its domain, and 
an ar(i?)-ary relation i? a C ^4 al '( fl ) for each R 6 1Z. First order logic FO T over 
t is defined as usual, including an equality predicate. A sentence is a formula 
without free variables. Writing ip(x) means that all free variables of the formula 
<p are among the entries of the tuple x = (xi, . . . , x n ). The set </? a is comprised 
of all a £ A n satisfying 21 (= (p(a), where the latter is defined as usual. 

1 Proofs of all seemingly unproven lemmas as well as the interpretations from Section [4] 
can be found in the appendix. 



Title Suppressed Due to Excessive Length 3 



Automatic Structures. The set of all (finite) words over an alphabet £ is £*, the 
empty word is s, and the length of w is \w\. Subsets of £* are called languages 
and L C 17* is regular if it can be recognised by some (non-deterministic) finite 
automaton. 

Let □ £ be a new symbol and I7 n = 7i7U{n}. For n > 1 consider an n-tuple 
u) = (wi, . . . , w n ) G (£*) n of words with = a^ia^ . . . a^ mi for all i G 
Let m = max{mi, . . . , m„} and a^j = □ for j G [m, + 1, m]. The convolution 
of w) is the word ®w = a\...a m £ (^S)* with a~j = (aij, . . . ,a n j) £ 17™ 
for all j G [1,ti]. An n-ary relation R C (17*)" is automatic if the language 
<8>.R C (i^g)*, which is comprised of all ®to with w G -R, is regular. 

A r-structure 21 with dom(2l) C 17* is (word) automatic if dom(2l) is regular 
and i? 21 is automatic for all R G TZ. A (word) automatic presentation of 21 is a 
tuple (»4dom; (AfiOfleTi) of finite automata such that Adom recognises dom(2l) 
and Ar recognises ®R % . Abusing notation, we call any structure 03 which is 
isomorphic to some word automatic structure 21 also (word) automatic. 

Tree Automata. A tree domain is a non-empty, finite, and prefix-closed subset 
D C {0, 1}* satisfying uO G D iff ul G D for all u G D. A free over 17 is a map 
t: D — > £ where dom(£) = D is a tree domain. The set of all trees is denoted by 
Tjj and its subsets are called (tree) languages. For some t eTj and u G dom(i) 
the subtree of i rooted at u is the tree i |ti G Ts defined by 

dom(i|u) = { v G {0, 1}* | uv G dom(i) } and (t\u)(v) = t(uv) . 

A (deterministic bottom-up) tree automaton A — (Q,i,5,F) over £ consists of 
a finite set Q of states, a start state function t: £ — > Q, a transition function 
8:£xQxQ^fQ, and a set f C Q of accepting states. For each £ G Tj; a 
state .4(f) G Q is defined recursively by .A(t) = t(t(e)) if dom(f) = {e} and 
A(t) = S(t(e),A(t\0),A(t\l)) otherwise. The language recognised by A is the 
set of all t E Tjj with A(t) G F. A language L C is regular if it can be 
recognised by some tree automaton. 

The convolution of t — (t\,...,t n ) G (Tj;)™ is the tree ®i G Tx:g defined 
by dom(®£) = dom(ii) U • • • U dom(i„) and (®i)(u) = (^(u), . . . ,t' n (u)), where 
t'^u) = ti{u) if u G dom(fj) and t'^u) = □ otherwise. A relation R C {Ts) n is 
automatic if the language <g)i? C T^g is regular. 

Tree automatic structures and iree automatic presentations are defined like 
in the word automatic case, but based on trees and tree automata. 

Linear Orderings. A linear ordering is a structure 21 = (A; < a ) where < a is a 
strict linear order relation on A. The ordering 21 is scattered if (Q; <) cannot be 
embedded into 21. Obviously, every well-ordering is scattered. For any two linear 
orderings 21 and 03 we define another linear ordering 21 • 03 by dom(2l • 03) = 
dom(2l) x dom(03) and (ai,&i) < a <8 (02,62) iff either a\ < a 02 or ai = 02 and 
61 < <B &2- Finally, if 2li can be embedded into 03 1 and 2l 2 into 032, then 2li • 2I2 
can be embedded into 03i -032- 
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3 Slim and Fat Tree Languages 

In this section, we introduce the notion of slim tree languages and show that it 
is decidable whether the language recognised by a given tree automaton is slim. 

Definition 3.1. The thickness 0(t) of a tree t £ Ts is the maximal number of 
nodes on any level, i.e., 

0(t) = max { |dom(i) n {0, 1}* | | I > } £ N . 

For every K > 1 the set of all t £ Ts with 0{t) < K is denoted by Ts,k- 
A tree language L C Ts is slim if there exists some K > 1 such that L C Ts,k, 
otherwise L is fat. 

A tree automaton A is reduced if for every state q of A there is a tree t £ Ts 
with A(t) = q. For every tree automaton A one can compute a reduced tree 
automaton which recognises the same language and has no more states than A. 

Theorem 3.2. Given a reduced tree automaton A, it is decidable whether the 
tree language L recognised by A is slim or fat. If L is slim, then L C Ts^-\, 
where n is the number of states of A. 

For the rest of this section we fix a reduced tree automaton A = (Q, t, S, F). The 
proof of Theorem 13.21 essentially depends on an inspection of the directed graph 
G A = (Q,E A ) with 

(p,q) £ E A iff 3a £ E, r £ Q : 6(a,p, r) = q or 6(a, r,p) = q . (1) 

Clearly, this graph is computable from A. The lemma below is shown by applying 
the idea of pumping to tree automata. Therein, the height h(t) of a tree t £ Ts 
is the number 

h(t) = max { |tt| | u £ dom(i) } £ N . 
Lemma 3.3. For every q £ Q the following are equivalent: 

(1) there are infinitely many t £ Ts satisfying A(t) = q, 

(2) there is a tree t £ Ts satisfying h{t) > n and A(t) = q, where n = \Q\, 

(3) G A contains a cycle from which q is reachable. 

An edge (p, q) £ E A is special if in the definition of E A in Eq. ((T|) the state 
r £ Q can be chosen such that it satisfies the conditions of Lemma 13.31 (for r in 
place of q). Since condition (3) is decidable, it is decidable whether an edge is 
special. The key idea for proving Theorem l3.2l is stated by the following lemma: 

Lemma 3.4. The following are equivalent: 

(1) the tree language L recognised by A is fat, 

(2) there is a tree t £ L satisfying 0(t) > 2 n ~ 1 , where n — \Q\, 

(3) Ga contains a cycle including a special edge and from which F is reachable. 

The proof of this lemma works similar to the one of Lemma 13.31 Since condi- 
tion (3) is decidable given A as input, Theorem 13.21 follows . 
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4 Slim Tree Automatic Structures Are Word Automatic 

This section is devoted to the proof of the following theorem: 

Theorem 4.1. Let 21 be a tree automatic structure such that dom(2l) is slim. 
Then, 21 is already word automatic and one can compute a word automatic 
presentation o/2l from a tree automatic presentation o/2l. 

The idea of the proof is the following. Let K > 1 be such that dom(2l) C T^ : k- 
We give an alphabet S and a one-to-one map C: Te.k — > £*, the encoding, 
such that C(L) is regular for all regular L C Te.k (Proposition I4.4j) and C(R) 
is automatic for all automatic relations R C (Ts : k) u (Proposition 14. 6p . Thus, 
the structure C(2l) is word automatic. A word automatic presentation of C(2l) 
is computable since both propositions arc effective and Theorem 13.21 allows for 
computing a suitable K . Although it is possible to show both propositions using 
automata, it is much more convenient to accomplish this by means of logic. 

4.1 Monadic Second Order Logic 

Monadic second order logic MSO T extends F0 r by set variables, which range 
over subsets of the domain and are denoted by capital letters, quantifiers for 
these variables, and the formula "x G A" (cf. [7]). Let r = (1Z, ar) and r' be two 
signatures. An (MSO-) interpretation of a r-structure 21 in a T'-structure 03 is 
a pair (f,2) comprised of a one-to-one map /: dom(2l) — > dom(23) and a tuple 
I = (zi; { ( I > r)r£ti) of MSO T '-formulae with free FO-variables only such that 
/(dom(2l)) = A* and f(R % ) = for each R G U. In fact, / induces an 
isomorphism between 21 and 1(03) = (A* 8 ; ((fr^ ) n^n) ■ Replacing in an MSO T - 
formula <p(x) all symbols R G 1Z with <Pr and relativising quantifiers to A yields 
an MSO T '-formula <p x (x) satisfying 21 |= <p(a) iff 93 |= ip x (f(a)) for all a £ A n . 

For an alphabet S the signature W£ consists of one binary relation symbol < 
and a unary symbol P a for each a £ S. Every word w = a\02 ■ ■ ■ a\ w \ 6 S* is 
regarded as a WZ'-structure with domain dom(w) = {1, . . . , \w\}, < w being the 
natural order on dom(u;), and i 6 P™ iff a, = a. For fixed numbers m,r £ N, 
relations like x = y + m and x = r(modm) are expressible in MSOwi:- The 
language defined by an MSOws-sentence <P is the set of all w G U* with w \= <P. 

The signature TE is similar to WZ 1 but contains two binary symbols So and 
5*1 instead of <. Each tree t G Ts is considered as a TZ7-structure with domain 
dom(i), (u,v) G iff ud = v (d = 0, 1), and u G P l a iff t(u) — a. The language 
defined by some MSOj^-sentence <f> is the set of all t G with t\= <P. 

The following theorem holds for word languages as well as for tree languages: 

Theorem 4.2 (cf. |7J). A language L is regular iff it is definable in MSO, and 
both conversions, from automata to formulae and vice versa, are effective. 



6 Martin Huschenbett 



4.2 The Encoding and Preservation of Regularity 

For the rest of this section fix the K > 1 from above. The first a 
objective is to give the encoding C: T^.k — > where $ is \/ ^ c 

a new symbol and £ = S x {0, 1} U {$}. For a tree t G T£ )if c 7 X b b 7 \ 
of height m — h{t) its encoding C{t) — gqUi ...cr m is made /\ 
up of m + 1 blocks ctq, . . . , <7 m G 17 x describing the individual 
levels of t. More specifically, o~i consists of the labels of the £-th Figure 1. The 
level from left to right, each enriched by a bit stating whether ox ' 
the corresponding node possesses children, and is padded up to length if by $ 
symbols. For example, the tree t cx G T{ a jb)C \ in Figure Q] on the right satisfies 
0(tex) = 4 and is, under the assumption K = 5, encoded by the word 

C(t ex ) = (a,l)$$$$ (b,l)(c,l)$$$ (c,0)(b,l)(b,0)(a,0)$ (a,0)(c,0}$$$ . 

Formally, for each £ € [0, m] let Uii, . . . ,uj> lSi be the lexicographic enumeration 
(w.r.t. < 1) of dom(i) n {0, 1} £ . For r G [1, se] we let ce, r — 1 if ue >r is an inner 
node, i.e. ui^ r {0, 1} C dom(i), and q iT - = if M£ ir is a leaf. Finally, we put 

(T£ = (t(u ei l),C e ,l)(t(u e ^),Ci^) ■ ■ • (t(u£ iSe ),C t , se )$ K ~ Se . 

The main tool for studying the map C : Ts,k S* is the following lemma: 

Lemma 4.3. For all t G T^.k there is an MSO -interpretation (fc,Zc) oft 
in C(t) such thatXc does not depend on t. 

Proof. Observe that for each inner node u of t the children of u are the (2s — l)-th 
and 2s-th node on the next level, where s is the number of inner nodes from left 
up to u on its level. Formally, for an inner node we have Ue tr d — u^+i^s— x+di 
where d G {0, 1} and s — C£ t \ + • • • + c^ r . Based on this observation, one can 
give an interpretation (fc,Zc) of t in C(t) such that fc{ug, r ) — £ ■ K + r. □ 

As a first consequence, we obtain t = Ic{t) = Ic{t') = t', and hence t = t', 
for all t,t' G T S>K with C(t) = C(t'). Thus, the encoding C is one-to-one. The 
proof of Proposition 14.41 is mainly based on Lemma |4~31 and Lemma 1431 below. 

Proposition 4.4. Let L C T^j< be a regular language. Then, the language 
C(L) C U* is also regular and one can compute a finite automaton recognising 
C(L) from a tree automaton recognising L. 

Lemma 4.5. Let a G S* ■ There exists a tree t G Tjik with C{t) = cr iff 
a = o~qo~ i . . . o~ n for some n > and o~o, . . . , a n G S K satisfying (a) and (b): 

(a) o-£ — ct£.i . . . ai^ se $ K ~ Si for some sg > 1 and on t %, . . . , ctg ySt G S x {0, 1} and 
for each £ G [0, n], 

(b) s = 1, s t+ i = 2-(c£,H \-ci, Sl ) for < £ < n, andc mA ^ \-Cm,s m = 0, 

where ag, r = (cLt lT , C£ ir ) . 
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Proof. To see that C(t) has the required shape, notice that (b) mainly reflects the 
relationship between the numbers of nodes on two adjacent levels. Conversely, if 
a G E* is of the required shape, then there is a tree t G Ts with t = Xc((t) and 
it turns out that 0(t) < K and C(t) — ctH □ 

Proof (of Proposition^^. Let Fc be an MSO w £-sentence which expresses the 
requirement on the shape of a from Lemma 14.51 By Theorem 14.21 there is an 
MSOji;-sentence <P defining L C Ts,k- Then, the MSO w jj-sentence Pc A <P Xc 
defines C{L) and, again by Theorem I4.2[ this language is regular. Finally, all 
employed constructions are effective. □ 



4.3 Preservation of Automaticity 

The purpose of this subsection is to complete the proof of Theorem 14.11 

Proposition 4.6. Let R C (Ts^k) u be an automatic relation. Then, the rela- 
tion C{R) C (E*) n is also automatic and one can compute a finite automaton 
recognising ®C(R) from a tree automaton recognising ®R. 



Basically, the key idea behind the proof is the same as for Proposition l4.4l though 
it is more involved. Let t — (ti, . . . ,t n ) G {Ts,K) n - Due to cardinality reasons, 
®i is commonly not directly intcrprctable in (g>C(t) but only in an n-fold copy 
of Cg)C(i). This is formalised by means of the one-to-one monoid morphism 

The interpretation of ®i in H(®C(t)) embraces two aspects which are better 
considered separately. Thus, we define an intermediate structure Hi which ex- 
tends the disjoint union of the V s on domain dom(IIt) = (Jie[i n] W x dom(^) 
by a binary relation L ut , relating all (i,u) and (j,v) with \u\ = \v\, and unary 
relations = {i} x dom(ii) for each i G [l,n]. Altogether, we give several 
interpretations whose formulae naturally do not depend on the specific choice 
of t. An overview of the whole setting is depicted in Figure [21 

■ • ■ , tn) {fu ' lu) . . . , ti,' '■■ , tn) {f "' lH) >H{®(C{U), C(t n ))) 



Figure 2. Interpretations involved in proving Proposition 14.61 



The Interpretation (/u,Xu). The main idea is to construct an MSO-formula 
E{x,y) with Ut \= E((i,u), (j,v)) iff u = v. To achieve this, consider for each 
(i,u) G dom(Hi) the set Pre(i,w) of all (i,u') G dom(n^) where u' is a prefix 
of u. For (i,u), (j,v) G dom(IIt) we have u = v iff \u\ = \v\ and for all (i,u') G 



2 More details on this can be found in Appendix lB.21 
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Pre(i,u) and (j, v 1 ) £ Pre(j, v) with \u'\ = \v'\ > the last symbols of u' and 
v' coincide. Since the set Pre((i, u), X) is definable in MSO, we can express this 
characterisation in MSO as well. Heavily using the resulting formula E, one can 
construct an interpretation (fu,Iu) of <X>£~ in lit such that fu{u) — (hu), where 
i is minimal with u £ dom(ii). 

The Interpretations {f®,i^®,i) ■ For all i £ [l,w] and w £ [S*) n one can easily 
give an interpretation (/(g>,i,2®,i) of Wi in H(®w) such that = (p— l)-n+i. 

The Interpretation {Jh^h)- For i £ [1, n] let {fc,i^c) be the interpretation 
of £j in C(^) from Lemma 14.31 Since the /®,i's have mutually disjoint images, 
the map : dom(IIf) — > dom(iJ(®C(f))) with = f®s{fc,i{ u )) is one- 

to-one. For (i, u) £ dom(IIf) we get |u| • K < fc,i{u) < (\u\ + l) • K and hence 

\u\ ■ K ■ n < fu(i, u) < (\u\ + l) • K ■ n. 

Exploiting this observation for the formula L Xh and using Xc and 2®,*, one can 
construct formulae Ir such that {/ff,2jj) is an interpretation of Hi in H(®C(F)\ . 

Proof ( of Proposition \4-6\ ). Let F# be an MSO w £„ -sentence defining the lan- 
guage H{®{S*) n ) C (£5)*. If <P defines ®R, then 

' v iE[l,nJ 

defines H(®C(R)). Since if is a one-to-one monoid morphism, ®C(R) is regular 
as well. Finally, all employed constructions are effective. □ 

5 Fat Tree Automatic Ordinals Are Not Word Automatic 

The goal of this section is to give the last missing piece for the proof of The- 
orem [HU namely the following theorem: 

Theorem 5.1. Let £ be a tree automatic scattered linear ordering such that 
dom(£) is fat. Then, £ is not word automatic. 

The theorem below states the necessary condition on word automatic linear 
orderings we use to show non-automaticity: 

Theorem 5.2 (Khoussainov, Rubin, Stephan [5j). If 2, is a word automatic 
linear ordering, then its FC-rank is finite. 

Actually, we do not need any details on the FC-rank (finite condensation rank) 
besides the fact that every scattered linear ordering £, having the property that 
for each r > 1 at least one linear ordering from 

K = {*i'%2'-'&r I Mi,..., Or e{(N;<),(N;>)} } 

can be embedded into £, has infinite FC-rank. The main idea of the proof is as 
follows: 
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Lemma 5.3. Let £ = (L; <) be a tree automatic scattered linear ordering, 
(A; A <) an automatic presentation of Z, n the number of states of A, and r > 1. 
If there exists some tree t G L with 0(t) > r ■ 2 n , then there are infinite linear 
orderings 2li, . . . , 2l r suc/i i/iai 21 1 • 21 2 • • • Sir can oe embedded into £. 

For any linear ordering 21 and all cii,a 2 € dom(2l) we define cmp a (ai,a 2 ) G 
{ — 1, 0, 1} to be —1 if ai < a a 2 , if a\ = a 2 , and 1 if a 2 < a 01. To simplify nota- 
tion, we put [s, ij< = A< (<8>(s, i)) for all s,( 6 Tg. Moreover, we assume w.l.o.g. 
that from [s,t]< one can deduce whether s = t holds true. Then, cmp £ (s,i) is 
determined by [s,t]< for all s,t G L, i.e., there is a map / from the state set 
of A< to {-1,0,1} such that cmp z (s,t) = /([s,t]<) for all s,t e L. 

Proof. Let T £ i be a tree and I > n such that |dom(T) n {0, l} £ \ > r • 2". Thus, 
there exist at least r mutually distinct u G dom(T) n {0, 1} £_ " for which there is 
a v € {0, 1}™ with uv G dom(T), say u\, . . . ,u r . For t = (ti, ... ,t r ) G {T^Y let 
1[t\ G Ts be the tree obtained from T by replacing for each i G [l,r] the subtree 
rooted at Uj with f j. Then, _4,(X[f]) is determined by the r states A(ti), . . . , A(t r ) 
for all t G (T'x') r . Moreover, for s £ (T^) r the tree ®(T[s],T[f|) is obtained from 
®(T, T) by replacing for each i G [l,r] the subtree rooted at tt, with (g)(sj,ii). 
Consequently, [T[s], T[i]J is determined by the r states [si,£i]<, ■ ■ ■ , [s r ,i r ]< 
for aU s,t£ (T s ) r . 

Observe that hif£\ui) > n for each i £ [l,r]. Therefore, by Lemma [3.31 and 
Ramsey's theorem for infinite, undirected, finitely coloured graphs, there exists 
an infinite set Ai C Ts of trees t G Ts with A(t) = A(1\ui) such that 

c(s,t) = {[«,«]<,[*,*]<,[»,*]<, [*,«]<} 

is the same set Qi for all distinct s, t G Aj. It turns out that Qi has exactly three 
elements and [[s, s]< = dM]< for all s,t G A{. 

Now, put A = Ai x ■ ■ ■ x A r . For each t G A we have -4.(3T[t]) = -4,(T) and 
hence 1[t\ G £. We define a linear ordering 21 = (A; < a ) by s < a i iff T[s] < 1[t\. 
By definition, 21 can be embedded into £. 

For i G [l,r], a G A, and i G we let G A be the tuple a with 
the i-th component replaced by t. Then, for all a, 6 and s, t G Ai we obtain 
[T[a l/s ],T[a l/t ]]] < = [T[6 i/s ], < and hence a l/s < a a i/t iff 6 i/s < a b i/t . 
Thus, defining a linear ordering 2U = (Af, < ai ) by s < a * £ iff < a aj/ 4 is 
independent from the specific choice of a G A. Clearly, cmp a . (s, i) is determined 
by [s,i]< for all s,t G Aj. Since Qi contains exactly three elements, [s, t]< is 
determined by cmp a .(s,i) for all s,t G Ai as well. Hence, the linear orderings 
21 and 2li, . . . , 2l r satisfy the condition of Lemma 15.41 below and consequently 
2l w (i) • • • 2l ff ( r ) can be embedded into £. □ 

Lemma 5.4. Let 21 and 2li, . . . ,2l r be infinite linear orderings with dom(2l) = 
dom(2li) x • ■ ■ x dom(2l r ) and satisfying the following two conditions: 

(1) cmp a (o, b) is determined by cmp 21l (ai, 61), ... , cmp ar (a r , b r ) for all a,b G A, 

(2) ifa,b£A differ only in the i-th component, then cmp a (a, b) — cmp a . (a*, bi). 
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Then, there exists a permutation it of {I, ... ,r} such that 21 is isomorphic to 

2tir(l) ' %v(2) ■ ■ ■ 2Lr( r ). 

Finally, we are in a position to prove Theorem 15. II 

Proof (of Theorem \5.1\) . Let (A.;A<) be an automatic presentation of £ and n 
the number of states of A. Since dom(£) is fat, for any r > 1 there is a t € dom(£) 
with 0(t) > r-2 n . Let 2li, . . . , 2l r be the infinite linear orderings from Lemma 15751 
For each i G [1, r] some 25^ G {(N; <), (N; >)} can be embedded into 2L. Then, 
«g 1 .*g 2 . . . Q5 r G J\f r can be embedded into 21 1 • 2b • • • 2L and consequently into £. 
Hence, £ has infinite FC-rank and is, by Theorem 15.21 not word automatic. □ 

6 Conclusions 

Altogether, we proved that is decidable whether a given tree automatic scattered 
linear ordering is already word automatic. Taking a closer look at the proof 
reveals that the problem is solvable nondeterministically in logarithmic space, 
provided the tree automaton recognising the domain is reduced. 

The restriction to scattered linear orderings naturally rises the question 
whether this result holds true for general linear orderings. Unfortunately, this 
problem cannot be solved by means of our technique since the ordering (Q; <) 
of the rationals admits a word automatic as well as a fat tree automatic present- 
ation. As the Boolean algebra of finite and co-finite subsets of N shares this 
feature, the same pertains to an analogue of Theorem 11.11 for Boolean algeb- 
ras. In spite of that, we suggest trying to apply the technique to other classes 
of structures, such as groups, for which a necessary condition on its automatic 
members is known. 

Finally, Theorem 11.11 provides a decidable characterisation of all tree auto- 
matic ordinals a > w w . Finding such a characterisation for each w w with k £ N 
possibly turns out to be the main ingredient for showing that the isomorphism 
problem for tree automatic ordinals is decidable. 
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A Proofs of Lemmas 13.31 and 13.41 

Recall that we fixed a reduced tree automaton A = (Q, l, S, F) and defined the 
graph G A = (Q, E A ) by 

(p,q) G E A iff 3a G S,r G Q : 5(a,p,r) = q or 6(a,r,p) = q. §Q 

An edge (p, q) G E A was called special if in Eq. ([1]) the state r G Q can be 
chosen such that it satisfies the conditions of Lemma \'S. 31 Moreover, we denote 
the prefix relation on {0, 1}* by ^, i.e., u -< v if there is some w G {0, 1}* such 
that uw = v. 

To simplify notation, we put t[u] — A(t\u) for each t G Ts and it G dom(i). 
In particular, t[e] = A(t). For all u G dom(t) with u0,ul G dom(i) we have 
S(t(u),t[uO],t[ulfj = t[u] and hence (t[u<i], t[u\) G -EU for both d = and d = 1. 
We denote these edges by t[ud,u\. Applying this argument repeatedly, for all 
u,v G dom(i) with u ^< v we obtain a path from t[v] to of lengtto \v\ — \u\ 
in G A , which we denote by t[i;, u]. 

Conversely, let (p, q) G -E^i be an edge in G A and t G Ts a tree with ^l(t) = p. 
Then, there are a G S and r £ Q such that, w.l.o.g., S(a,p,r) = q. Moreover, 
there is a tree s G Ts with »4(s) = r. Then, the unique tree G Tu with 
<'(£•) = a, f'fO = and t'\l = s satisfies A(t') — q and t'\0 — t. Applying this 
argument repeatedly, we obtain for each path from p to q of length m in G A and 
any tree t G Ts with A(t) = p another tree t' G Ts and a position u G dom(t') 
such that A(t') = q, \u\ = m, and t'\u = t. 

Lemma 3.3 For every q G Q the following are equivalent: 

(1) there are infinitely many t 6 Ts satisfying A(t) = q, 

(2) there is a tree t G Ts satisfying h{t) > n and A(t) — q, where n = \Q\, 

(3) Ga contains a cycle from which q is reachable. 

Proof. Trivially, (1) implies (2). It remains to show that (2) implies (3) and 
(3) implies (1). 

To (2) implies (3). Let t G Ts be a tree with h(t) > n and A(t) = q. Consider 
some u G dom(i) such that |tt| = n. Then, t[u, e] is a path of length n ending 
in q. Due to the pigeonhole principle, this path contains a cycle. 

To (3) implies (1). It suffices to show that for each m > there is a tree t G Ts 
with A(t) = q and h(t) > m. Thus, consider some m > 0. There is a path of 
length m ending in q. Let p G Q be the first state of this path and s G Ts a tree 
with v4(s) = p. Then, there are a tree t G Ts and u G dom(i) with .A(t) = q, 
|tt| = m, and t\u = s. In particular, h(t) > m. □ 



3 The length of a path is the number of its edges. 



12 Martin Huschenbett 



Lemma 3.4 The following are equivalent: 

(1) the tree language L recognised by A is fat, 

(2) there is a tree t £ L satisfying 0{t) > 2 n ~ 1 , where n — \Q\, 

(3) Ga contains a cycle including a special edge and from which F is reachable. 

Proof. Trivially, (1) implies (2). It remains to show that (2) implies (3) and 
(3) implies (1). 

To (2) implies (3). Using induction on m > we show the following: For every 
tree t £ T s with 0{t) > 2" 1 " 1 and \Q t \ < m, where 

Qt — { t[u] | u £ dom(r) } , 

there are ui,u% £ dom(i) such that U\ -< U2 and t\u%, u{\ is a cycle containing a 
special edge. 

For to = there is nothing to show. Thus, let m > 0. Consider an £ > 
such that \U\ > 2" 1 " 1 for U = dom(t) n {0, l}f Let u £ dom(<) be the longest 
common prefix of all positions in U. Clearly, £ > \u\ + m. There are two cases: 

1. There is a v £ dom(t) with u -< v and t[u] = t[v]. W.l.o.g., we assume uQ ^< v. 
By the choice of u, there is some w £ U with ul ^ w. The path t[w,u] has 
length I — \u\ > m and hence contains a cycle. Since t[ul] lies on or after 
this cycle, i.e., t[ul] satisfies condition (3) of Lemma 13.31 Thus, the edge 
£[u0,u] £ Ea-, which is contained in the cycle t[v, u], is special. 

2. There is no v £ dom(£) with u <v and t[u] = t[v]. In particular, 2 < \Q t \ < 
m. Since 0{t\u) > \U\ > 2™- 1 , we have 0(t\uO) > 2™" 2 or 0{t\ul) > 2 m ~ 2 . 
W.l.o.g., assume 0(s) > 2 m ~ 2 for s = t\u0. We have t[u] £ Q s and hence 
|Qs| < \Qt \ < m. By the induction hypothesis, there are v, w £ dom(s) such 
that v -< w and s[w, v] is a cycle containing a special edge. The claim follows 
from uOv -< uOw and t[u0w,u0v] = s[w,v]. 

Finally, consider some t £ L(A) with 0{t) > 2™ _1 . Obviously, \Q t \ < \Q\ = n. 
From the cycle t[u2, u±] we can reach a state from F along the path t[ui, e]. 

To (3) implies (1). Using induction on to > we show that if there is a path 
containing m special edges and which ends in some q £ Q, then there is a tree 
t £ Ts with A(t) = q and 0{t) > to. Due to the cycle there are paths containing 
arbitrarily many special edges and which end in F. Thus, condition (1) will 
follow. 

For to = any tree t £ Ts with A(t) = q trivially satisfies 0(t) > 0. Thus, 
consider to > 0. Let (p, r) £ £U be the last special edge in the path. By the 
induction hypothesis, there is a tree s £ Ts with A{s) — p and 0(s) > m — 1. 
Let £ > be such that | dom(s) n {0, l} e \ — 0{s). Moreover, there are a £ S 
and p' £ Q such that, w.l.o.g., 6(a,p,p') — r and p' satisfies the conditions of 
Lemma [3~3l Thus, there is a tree s' £ Ts such that h(s') > £. Then, the unique 
tree t 1 £ Ts with t'(e) = a, t' \0 = s, and t' \l = s' satisfies A(t') = r. Since there 



Title Suppressed Due to Excessive Length 13 



is a path from r to q, say it has n edges, there are t G Ts and u S dom(i) such 
that A{t) — q, \u\ — n, and t\u = t' . From the construction of t we obtain 

dom(i) n {0, l} n+1+e D u (dom(t / ) n {0, 1} W ) 

= ?i0(dom(s) n {0, 1}*) U ul(dom(s') n {0, 1}^) . 

Since the union on the right hand side is disjoint and the set dom(s') n {0, l} e 
is not empty, we have 

|dom(t) n{0,l}" +1+£ | > |dom(s)n{0,l} £ | + |(dom(s')n{0,l} £ | >m, 

i.e., 0(t) > to. □ 
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B Interpretations and Formulae from Section [4] 
B.l The Interpretation (fc,Xc) 

Lemma 4.3 For all t £ T^.k there is an MSO -interpretation (fc,Ic) of t 
in C(t) such thatXc does not depend on t. 

The formulae X c = {A c ; {S^ a )de{o,i}, (Pa°)aes) are as follows: 
A c (x) = ^P${x) 

Sd°(x, V) = \/ aeS P(aM x ) A A c(v) A 3z(y>(x, z) A V 1<s<f ^> z )) 

P a Ic (x) = A c {x) A (P (a ,o)(z) V P {aA) (x)) 
tp(x, z) = z = 1 (mod if) Az<i<z + X 

V»<j,«(a;,j/,z) = 3 =s z'(z < z < x A\J s P( a ,i)(z')) Ay = z + K + 2s-2 + d 

(l<*<f) 

B.2 Proof of Lemma 1331 

Lemma 4.5 Lei cr <= Z 1 *. There exists a tree t £ T^.k with C(t) — a iff a = 
<7o<7i . . . cr„ /or some n > and o~o, . . . , o~ n £ £ satisfying (a) and (b): 

(a) o-g — a^i . . . oc^ st $ 1 for some sg > 1 and ao, . . . , ag, Se £ £ x {0, 1} and 
for each £ £ [0, n], 

(b) s = 1, s i+ i = 2 ■ H \-ce,s t ) for0<£<n, and c„,,i H l-c m) g ro = 0, 

where ae, r = {ae. r ,C£. r ). 

Proof. To see that C(t) has the required shape, choose n = m, o~i and si as in 
the construction of C(t), and a^ r = (t(u^ iT .), Q )T .). Then, condition (a) is trivially 
met, whereas (b) is satisfied since each tree has exactly one node on the zeroth 
level, on each other level twice as many nodes as inner nodes on the previous 
level, and no inner nodes on the last level. 

Conversely, consider some a = ot\ . . - (Xfn+u.K G ^J* 01 the required shape. 
Let T = Xc {&) and T = dom(1). First, we observe that 

T = {l-K + r \l£ [0,n],r e [l,s/] } 

and put (a p , c p ) = a p for all p £T. For p = i ■ K + r 7 q £ T and d £ {0, 1} with 
I £ [0, n] and r £ [1, s e ] we have (p, q) £ 5f iff c p = 1 and q = (£+l)-K+2s-l+d, 
where s — C£.k+i + ■ ■ - + c p . Notice that this reflects the introductory observation 
in the proof of Lemma 14.31 

Second, we construct a map f:T—> {0, 1}*. We define the value f(q) by 
induction on q £ T. The resulting map will satisfy \f{q)\ = £ for all q £ T 
with £ ■ K < q < (£ + 1) • K . The first condition of (b) yields q = 1 or q > K. 
We put /(l) = e. For q > K there are unique £, r £ N with < £ < n and 
1 < s < ^±1, and d £ {0, 1} such that q = (£ + 1) • K + 2s - 1 + d. From the 
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second condition of (b) we conclude s < c^.k+i + ■ ■ ■ + ce-K+s e - Thus, there is a 

least p E [£ ■ K + 1, £ ■ K + si] such that s — c^.k+i H h c p and the minimality 

implies c p = 1. Since p < q, we are allowed to put f(q) = f{p)d. 

A simple but tedious inspection of this construction shows for all p,q G T that 
p < q iff f(p) <iiex /(?), where <ii ex is the length-lexicographic order on {0, 1}*. 
In particular, / is one-to-one. Obviously, the set D = f(T) is non-empty and 
finite. Due to the construction of / it is also prefix-closed. For uO G D we have 
/ _1 (u0) + 1 G T and ul = /(/ _1 (u0) + l) G D. Similarly, til g D implies 
uO G D. Thus, Z) is a tree domain. We define a tree t G T s by dom(t) = Z) and 
= aj-i( u ). It turns out that / induces an isomorphism /: 1 — \ t between 
T£-structures. 

From the earlier remark on /(g) | we conclude 0(t) < K. Moreover, h(t) = n 
and f{£ ■ K + 1), . . . , f(£ ■ K + se) is the lexicographic enumeration of dom(f) n 
{0, 1} for all < £ < n. For p G T with p < n ■ K the specific choice of / 
yields c p = 1 iff f(p){0, 1} C dom(t). For p > n - K this also holds true since the 
third condition of (b) implies c p = and \ f(p)d\ > n implies f(p)d dom(t) for 
d G {0, 1}. Altogether, we obtain C(t) = a. □ 

B.3 The Sentence T c 

The sentence Fc is denned as follows: 

F c — &\ A <P 2 A ^3 A ^ 4 A <P 5 
$i = 3x(Vy(y < x) A x = K (mod K)) 
<P 2 = Vx(x = 1 (mod K) -> Ely <p(x, y)) 
= 3x(ir = 1 A <^(ar, or)) 



(fi(x,y) = x<y<x + KA Vz(x < z < x + K (P$(z) 4-> y < z)) 
ip s (x) = 3j/i3j/ 2 (<p(a;i,yi) A ^{x^yi) A% 5 (a;i,yi) Ay 2 = x 2 + 2s - l) 



The purpose of the formulae <Pi is as follows: 

<P\ — the length \a\ of cr is a positive multiple of K, say \a\ = (m + 1) • K 

<!>i — a can be written as a — ao ■ ■ ■ &m with <r , ■ ■ ■ , <r m G S K such that (a) is 

satisfied, therein ip(£ ■ K + l,p) holds precisely for p = £ ■ K + si 
<p3 — the first condition of (b) is satisfied 
$4 — the second condition of (b) is satisfied 
(p 5 — the third condition of (b) is satisfied 





(l<*<f) 

(0 < 8 < f ) 
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B.4 The Formula E and the Interpretation (/it,Xij) 

The formula E{x\,x 2 ) is defined as follows: 

E(xi,x 2 ) = L(x!,x 2 ) A3X 1 3X 2 (ip Pre (xi,X 1 ) Aip Pre (x 2 ,X 2 ) Aip(X 1 ,X 2 )) 
ip Ple {x, X) = <pci(x, X) A VY(<pci(x, Y) -> Vy(y eX -+ y eY)) 
ipci{x, X) = x G X A VyVz(S'(y, z) A z G X -> y G X) 

S(l/,«) = 5 (»,«)VSi(y,«) 
^(X 1; X 2 ) = V-ZiV^i € Xi Az 2 G X 2 A L( Zl ,z 2 ) A (3yS(y, Zl )) -»• x(z 1; z 2 )) 

X^i,^) = 3yi3y 2 ((S'o(yi,2i) A 5 (y 2 ,^ 2 )) V (Si(j/i,2i) A Si(y 2 ,z 2 ))) 
The ideas behind these formulae are the following: 

<pci(x,X) — the set X contains x and is closed under taking predecessors 
</3p rc (x, X) — the set X is the smallest one (w.r.t. inclusion) having the property 

<pc\(x, X) and hence Ut \= <pp le ({i, u), If) iff U = Pre(z, u) 
%/j(Xi,X 2 ) — describes the condition on Pre(i,u) and Pre(j, v) in the character- 
isation of | it | = \v\ in terms of these two sets 

Lemma B.l. For all i G (Tj) n there is an MSO -interpretation (fu,%u) of ®i 
in Hi such that fu(u) = {i,u), where i is minimal with u G dom(ij), and Xu 
does not depend on t. 

The formulae X u = (a Xu ; { S d U ) d e{o 1} , (-Pa In ) S£ £ s ) are as follows: 

A iu(x)=\J (Qi(x) A/\ <pj,n{xj) 
Sj" (or, y) = A In (x) A A Xu (y) A 3z(S d (z, y) A E(x, z)) 

<p i , a {x) = 3y(Q i {y)AP a {y)AE{x,y)) (i G [l,n],a6£) 

= -3y(Qi(y) AE(x,y)) (i G [l,n]) 

B.5 The Interpretations {f®,i,I®,i) 

Lemma B.2. For each i G [l,n] and aZZ u) G (I?*)™ t/iere is an M SO -interpre- 
tation (f®,i,Z®,i) of Wi in H(®w) such that f®s(p) = (p — 1) • n + i and 1®^ 
does not depend on w. 

The formulae X®,, = (^A X0i ; < 1 ®- i , (Pa®'') ae gj are as follows: 

Aia,i(x) = x = i (modn) A -iipi, a (x) 
<x»„ ( Xj y) = (x ) A (y) Ax <y 

1>i,a(*) =V- r (ae£ a ) 
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B.6 The Interpretation 

Lemma B.3. For all t G (Ts,K) n there is an MSO -interpretation (/j?,Xh-) 
of lit in H(®C(t)) such that 2 h does not depend on t. 

The formulae I„ = (a Ih ; (^) d£{01} , {P?») a€E , I?» , (Q?")^) areas 
follows: 

S*»{x,y) = \J ^(A^M A A T ^{y) A (S^f^(x, y)) 

P X a H (*) = \J 1<i<n {A x ^(x) A (P^)^(x)) 
L lH {x 1 y) = A Th (x) A Ai H {y) A 3z(z = f (mod if ■ n) A z < x,y < z + K ■ n) 

Qf H (x) = A x ^(x) 



B.7 The Sentence r H 

The sentence is defined as follows: 

r H = $ A A 

<£> = ~ixiy(x = I (modn) A x < y < x + n (P a {x) A P a (y))^J 

Vi = Mx\/y{x <yAtp hD (x) -> ipi, n (y)) (i G [l,n]) 

where ^>j iD is the formula from the construction of X®^ for i G [1, n]. The formula 
defines the set iJ((l7™)*), whereas ^ defines the set of all words w G (-^S)* 
with 7Ti(w) G Z"*n*, where 7Tj : (£□)* ~^ (^Vj)* is the projection to the i-th 
component. Thus, the conjunction Aie[i n ] ^ defines the language ®{S*) n . Since 
H(®(E*) n ) = H((E™)*) n r H defines this set. 
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C Proof of Lemma 15.41 

Lemma 5.4 Let 21 and 2ti, . . . ,2l r be infinite linear orderings with dom(2l) = 
dom(2li) x ■ ■ ■ x dom(2t r ) and satisfying the following two conditions: 

(1) cmp a (a, 6) is determined by cmp 21l (oi, b\) 1 . . . , cmp ar (a r , b r ) for all a,b e A, 

(2) if a, b G A differ only in the i-th component, then cmp g (a, b) = cmp a . (dj, 6j). 

Then, there exists a permutation tt of {1, ... , r} smc/i i/ia£ 21 is isomorphic to 
21tt(i) • 21tt(2) • • • 2t^(r)- 

Proof. Let 21 = (A; <) and 2li = (Af, <,) for each z G Consider a, 6 G A 

with ai <, 6, for all i G [1, r]. If we put = (&i, . . . , bk, o-k+i, ■ • ■ , a r ) and apply 
condition (2) repeatedly, we obtain 

a,c(»)<c-( 1 )<...< c -M=6. (*) 

For the rest of this proof, fix x, y, z G A such that xi <, <\ Zj for all 1 < i < r 
and put = aty^. Since the are mutually distinct, there is a permutation 
7r of {1, . . . , r} such that 

eW 1 )) > gM 2 » > ... > e Wr)) . 

Due to condition (1), this permutation does not depend on the specific choice 
of x and y as long as Xi <, j/j for all i G [1, r]. To see that this permutation has 
the desired property, we need show that the one-to-one correspondence 

/: 21 -)• 2l w (i) • •■2t w ( r ), (ai, . . . , a r ) \-t (a 

is an isomorphism. However, to keep notation clear, we simply assume that 7r is 
the identity. 

Therefore, it suffices to show that a < a i ,,,a '- 5 implies a < b for all a,b G A. 
Thus, consider a, 6 G A satisfying the premise. Due to the definition of 21 1 • • • 2l r , 
there exists a /c G [1, r] such that Ok <k bk and = bi for all i G [1, k — 1]. For 
all i G [1, r] we let = min 2ti (a 4 , 6j), ft, = max a , (a,, &*) 

c (i) = (9x,--.,9i,hi + x,...,hr) G A, 

and 

d w = (9i,---,9i-i,hi,gi+i,...,g r ) 6 A. 

Using induction, we show a' 1 ' < dW for all i G [1, r] with / /ij. Clearly, <?j ^ ftj 
implies <7j <j /i*. For the largest i with this property, c^ < dW directly follows 
from condition (2). For all other i with ^ hi consider the least j > i with 
gj 7^ /ij . We obtain the following chain of inequalities 

c [i ) =#} ., <c U , ] .. <S J , ) ., , 

i/Xi,3/yj VBirf/Vi l/Vi,3/Xj ^/Vi,3/Vj */*i>3/x} ' 
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where the first and last inequality are due to e^> < ey> and condition (1), 
whereas the second one is implied by the induction hypothesis < dV> and (1). 
Finally, a last application of (1) yields < dvK Altogether, we obtain 

a < c {k) < d {k) < b , 



where both non-strict inequalities use Q. 



□ 



